Durational Evidence for Syllable Boundary of /n/ and /l/ in Text-to-Speech Synthesis
نویسنده
چکیده
The Text-to-Speech (TTS) system does rely on syllable boundary information for segmental duration. However, ambisyllabic consonants always pose a problem to TTS because the system requires clear syllable boundaries to segment and concatenate. In order to provide a possible solution to this problem, /n/ and /l/ in VLCAVR are chosen in this paper as the target to be examined whether their durations behave more like the syllabic onset or coda when comparing with the durational properties of /n/ and /l/ both as onsets in CV and codas in VC. As the syllable boundaries, onset C shows much more sensitivity to stress than coda C while coda C shows more sensitivity to syllabic position than onset C. Moreover, CA in VLCAVR is also influenced by two variables of stress and position as C in CV and VC. The results show that the intervocalic CA holds the properties of both the syllabic onset and coda, which states the possibility that intervocalic consonants should be considered as a rather independent concatenative unit in TTS synthesis.
منابع مشابه
مراحل و نحوه ی تهیه ی دادگان های صوتی هجایی و دایفونی برای سامانه ی تبدیل متن به گفتار فارسی
Abstract Speech databases are part of the concatenative text to speech synthesis systems. Phonetic quality of the databases plays a significant role in the naturalness of the synthesized speech. This paper introduces two syllable and diphone speech databases for Persian and investigates the way of their development and their specifications and their advantages to each other. ...
متن کاملDuration and Pauses as Cues to Discourse Boundaries in Speech
Duration is a primary factor both to achieve more naturalsounding synthesis and as an indicator of phrasal organization in speech recognition. In this study, we investigate pauses and durational patterns in spontaneous conversation, as well as how reliably such elements can serve as boundary-marking predictors across different types of speech corpora. Our results show that pause duration is sig...
متن کامل[hal-00463205, v1] Durational Cues and Prosodic Phrasing in French: Evidence for the Intermediate Phrase
Studies addressing prosodic constituency in French generally agree on two levels of phrasing (accentual phrase, AP, and intonation phrase, IP), while the existence of an intermediate level of phrasing (intermediate phrase, ip) is still controversial. In this study we examine durational cues in a read speech corpus at normal and fast rates in which the target syllable was either adjacent to a pr...
متن کاملDurational Cues and Prosodic Phrasing in French
Studies addressing prosodic constituency in French generally agree on two levels of phrasing (accentual phrase, AP, and intonation phrase, IP), while the existence of an intermediate level of phrasing (intermediate phrase, ip) is still controversial. In this study we examine durational cues in a read speech corpus at normal and fast rates in which the target syllable was either adjacent to a pr...
متن کاملDevelopment of Concatenative Syllable based Text to Speech Synthesis System for Tamil
This paper addresses the problem of improving the intelligibility of the synthesized speech in Tamil TTS synthesis system. The human speech is artificially generated by Speech synthesis. The normal language text will be automatically converted into speech using Text-to-speech (TTS) system. This paper deals with a corpus-driven Tamil TTS system based on the concatenative synthesis approach. Conc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Journal of Multimedia
دوره 8 شماره
صفحات -
تاریخ انتشار 2013